Fix a bug in extract_best_per_route kernel by rg20 · Pull Request #156 · NVIDIA/cuopt

rg20 · 2025-06-27T14:13:53Z

Description

The mentioned kernel does not require any dynamic shared memory, however, we are passing sh_size that is relevant for the previous kernel in the code. In most cases it is fine. However, if the sh_size is more than the shared memory available, the kernel fails to launch with cudaInvalidValue error. The previous kernel has no issues because the dynamic shared memory is tuned for that kernel using cudaFuncSetAttribute call.

This PR passes zero as the dynamic shared size to resolve the issue.

Issue

This is a bug reported by a customer.

Checklist

I am familiar with the Contributing Guidelines.
Testing
- New or existing tests cover these changes
- Added tests
- Created an issue to follow-up
- NA
Documentation
- The documentation is up to date with these changes
- Added new documentation
- NA

copy-pr-bot · 2025-06-27T14:13:58Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

rg20 · 2025-06-27T14:18:59Z

/ok to test e5ff891

rg20 · 2025-06-27T18:31:01Z

/merge

Do not pass dynamic shared memory size for extract_best_per_route kernel

1c3cb7f

rg20 requested a review from a team as a code owner June 27, 2025 14:13

rg20 requested review from akifcorduk and chris-maes June 27, 2025 14:13

rg20 added bug Something isn't working non-breaking Introduces a non-breaking change labels Jun 27, 2025

rg20 added this to the 25.08 milestone Jun 27, 2025

Revert unnecessary change

e5ff891

hlinsen approved these changes Jun 27, 2025

View reviewed changes

rapids-bot bot merged commit 89aef1b into NVIDIA:branch-25.08 Jun 27, 2025
141 of 142 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a bug in extract_best_per_route kernel#156

Fix a bug in extract_best_per_route kernel#156
rapids-bot[bot] merged 2 commits intoNVIDIA:branch-25.08from
rg20:multiple_insert_bug_fix

rg20 commented Jun 27, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Jun 27, 2025

Uh oh!

rg20 commented Jun 27, 2025

Uh oh!

rg20 commented Jun 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rg20 commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issue

Checklist

Uh oh!

copy-pr-bot bot commented Jun 27, 2025

Uh oh!

rg20 commented Jun 27, 2025

Uh oh!

rg20 commented Jun 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rg20 commented Jun 27, 2025 •

edited

Loading